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< ABSTRACT . - I 

Pffe-enr oilmen t dn*racteri?tics that distinguish 
non persisting students from students who 'persist into the .second 
semester of their freshman year were studied at the Onivetsity of 
T<ajra it* 197B. Three sources of information were used to obtain * 
psed^rtor variables: American College Testing (ACT) prcgram test 

^scoreF,, the ACT Student Profile, and high school transcript . 
.information from the registrar. Study objectives wer^ tc reduce the 
large number pf pre- enrollment variables to a small .hunger of useful 
prediction variables and to use linear and nonlinear classification 
techniques to exhaust the predictive power of 

the predictor variables* Of the 2 , tSO"* entering freshmen in the fall 
197 8 cohort, there was enough information to allow factor scosmgj, for 
1,711 of the students. Jt.was found /that the difference between using 
i*3 factors' and 23 factors* was negligible, Ttiere.were no indicati<»s 
from t*is study or previous. stu£i*« that a larger nujibef of the » ^ 
pre-enrollment variables could account for more than variant of 
persistence; however, the 23 factors did not, predict retention any 
better than an earlier study that used only three variabl'es^It^is 
concluded that pre-enrollment variables perse did not directly 
predict persistence* It is recommended "that the smallest .set of. ^ 
factors -that maintain? the/£urrent level of prediction cf persistence 
should be developed -and 9 used in retention studies to -save on costs of 
.* data storage and analysis-. *(SB) 
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XlflOMCfZOI / 

To increase fcheir, probability of survival, colleges and 
universities need to increase student enrollaent. Student 
enrollaent ds increased* by reducing the proportion of sta- 

dents who withdraw^or by increasing the noaber of incoaing; 

,» 

students. The retention of current students is aore cost 
effective than raising the number, of adaissions. Suaaers- 
kill (1962) and Pantages and Creedon (1978) have shovn th$t 
50* of an institutions entering students do dot graduate 
''froa that, institution. ' forty percent of this non-persisting 
group transfer and eventually graduate froa another institu- 
tion. The regaining 60t of the non-persisters never gradu- 
ate. The departure of such a large nuaber of students is 
.very costly to the institution in ^«es of lost revenues and 
to nany of 'the students interns of satisfaction with their 
educational experience. If potential no n- persisting stu- 
dents could be accurately identified, institutions would 
ha,ve the opportunity to target these students for additional 
h'lp (advising, counseling, adjustaent progress, etc.) and 
thereby increase student retention. 

The tera retention has been defined in teres of the 
period of tise used ,t6 classify persisters and non-persis- 
ters. Definitions oAreteniion for classification of per-* 
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'sisters and non-persisters liave tanged f roe continued en- 
rollment otbc one seaester foF first seiester freshaan to 
enrollment' for .a 1 perffSd of A and 5 years or graduation. -Be- 

tention research at The University of Iowa has encoapassed 

*** ."* 
the entire range of definitions. Unfortunately, none of $ 

these studies were able to, accurately deteraine the* varia- 
bles predicting retention or accurately classify students- as 
persisters or non-persisters. 'The lack of success of these 
studies aay have been due to the heavy use of pre-enrollaent 
data such as ACT scores and High School transcript inforaa- 
tion. This study focuses only on retention in the first - 
seiester of college. This choice of the Uae period would 
presumably allow the pre-earqllaent data to have its aaxiaua 
effect. The usual first seaester rate of non persistence at 
The University, of Iowa is about 7%. although this is not as 
high as the second seaester nonpersistence rate, the second 

s 

.^seaester' was assuaed ,to be influenced pore by factors other 
than the pre-enrollaent data. . 
Specific objectives of this study were* to : ^ 
(1) identify pre-enrollaent characteristics that dis- 
^ tinguish no*- persisting students froa students who 
persist into the second seaester of their freshaan 
year, 

,(2) reduce the large nuaber of prerenrpllaent varia-. 
bias to a snail nuaber of useful prediction varia- 
bles, and 



(3) use linear and nonlinear classification techniques 
to thoroughly exhaust the predictive power of the 
♦ prediction variables. 



rasamua, 

geblocts 

The- Fall 1978 class of entering freshsan at The Univer- 
sity of Iowa served as the prinary saaple in. this study, 
about 7* of the 1978 cohort dropped oat before the start of 
the second sesester ofjthe 1978-1979 acidesic year. This 
rate is sisilar to those-observed it past y^ars. The 1978 
cohort was grouped as ffollovs : 

"-Group a: "lon-persisters" the 7* of the freshsen 

vho left the university by the second sesester of 
S \t~leir freshsan y*ar; 

-Group B: "Persisters" r^ttose freshsen in the 

* 1978 cohort who started the second sesester of 

their freshsan year.^ 
Separate analyses were not done fpr sales and resales 
due to the seal! saaple sixe of group 1. previous* studies 
hate indicated^ that there is no. difference in persistence 
for sales and females. The Pall 1979 class* of entering 
freshian at The university of Iowa served as the cross 
da t ion group. 
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Three sources of information vere need in this stady to 
obtain predictor variables : ACT tast scores, tha 1CI Stu- 
dant Prdfile, and high school transcript information froi 
tha University Begistrar. the 1CT testa aaasara acadeaic 
skills in English, aatheeatics, natural science, and social 
science. 1 composite test score based oni pacforaanca in 
these areas is also provided, the Studant Profile Bepoct 
provides information about the student 1 * background, past 
accoaplishaents, special needs, and future educational/voca- 
tional objectives, lido provided in tha Student Profile Ber 
port are special scores froa an 1CT Interest Inventory, in- 
foraation used froa the high school transcript includes , , 
grade point average, claaa rank, and the sixe of^the gradu- 
ating class, in aost cases, the high achool transcript in- 
foraation and all tie iCT data sere^ provided to The Univer- 
sity of loaa as a reguireeent for adaisaion. Osing data 
directly froa the above sourcea, nine coablned variables 
«ece created. . These variables were aiaple euaa'of binary 
response' iteas relating t* specific areas of students' ex- 
tracorricular activities in high" school. '- * 

4 . •* ' • ~' • - ■ . • 

v 
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i pearaon correlation* eatrix «aa produced using the 

pre-enrollaent intoraation froa.*he 1*78 cohort, lisaing. 

fames vere not replaced ^anile developing ttoi correlation 



■atrix. As a result, *he nuaber of cases used to calculate 
the correlations ranged froa 46 to 2842, depending on the 
variables' correlated. The correlation-%i*rii foraed the ba- 
sis for the subsequent calculations, 

A Principal Components analysis vas .performed on the 
correlation aatriz to* produce eigenvalues and to deteraine 
the portion of Variance each factor contributed to the ove- 
rail variance. Hased on *ihese results, two sets of factors 
were chosen. The first set consisted of the fewest nuaber 
of factors that coold account for a substantial proportion 
of the variance, arbitrarily determined to be 65*. Forty-^ 
three factors were 'retained using the -above criterion. If a 
ainieua eigenvalue of 1 been qsed as the criterion, forty* 
oije factors would hate be^n .retained. The second set con- 
sisted of those factors that contributed at least 1.1* to 
the total Variance. The criterion vas chosen after reel- 
ing the criterion produced using the Scree aethod q£ factox 
selection/ The second set of^factors vas produced because 
forty-three factors was considered too large a nuaber of 
factors , to retain. Twenty-three factors were retained using 
this second approach* < 

Both sets of factor 9 were rotated using varieax rota- 
tion and factor scores were calculated for each student. 

Two discriainant classification functions were calculated 

• # 

using the two different sets of factor scores. The classi- 
fication variable was first seaester persistence, coded as a 



zero or a one. k test of homogeneity was nade for eadh 
function to determine whether the pooled doviriance , fatrice* 
for the v d.iscrieinant functions could be validly used. 

• Cross validation vas performed on the 1979 cohort. The 
factor scoring patterns f ron each factor analysis were used 
to calculate, the factor scores for the 1979 cohort. Both 
the 43- and 23-factdr scores were obtained. Toe discrimi- 
nant functions calculated for the .1978 cohort, were then ap- 
plied to the 1979 factor scored data. 

Finally, in order* to account for. possible nonlinear ity 

* . 

in the data, a near neighbor analysis^was performed on the 
43-factor data. Only part of the analysis 4as completed die 
to theF high cost. The incomplete analysis yielded poorer 
results than .the discriminant results on the calibration 
dat*. • , 

\ . v 

^ These were 2,850 entering freshman at The* University of 
Iowa in the fall 1978 cohort. Although 132 pre-enrollaent 
variables were theoretically available for each student, 
there were aissing values. Eleven of the pre-enrollaent va- 
riables had less ^han 2000 respondents. . To increase the 
number of usable cases in both the 1978 and 1979 cohort, av- 

erages were substituted for aissing values for these eleven 

• . • -. • ■ , 

variables. Thus 1,711 of these students had enough inforaa- 
tion to allow factor scoring.' 



. " The results of the first factor analysis used to pro-, 
dace eigenvalues and to de^ersine' portion of variances are 
given in Table 1. . The sets of 43 and 23 factors were ob-*- 
tained froi this factor analysis. - ■ 

The'xotated 43-f actor ,set accounted for 65* of the ove- 
rall- variance. The calibration data applied to the discri- 

• »*.-_ - * 

■inant function yielded the^results- shown in Table, 2. .The 

accuracy of the prediction was 95/73*, the actual rate of 

retention was 93.73*: The test of hdiogeneity for the with- 

in^variance "satrices was rejected at the p<.001 level with 

a chi-sguare value of 12fi^ and 946 degrees of freedoi. ihen 

the 2801 students of- the 1979 cohort were factor scored',. 

» 

1697 ~of ^Be subjects regained for analysis. These subjects 

• 

were then entered into the discriiinant function. . The re- 
sults of this procedure are shoun in Table 3. The discrimi- 
nant function had an accuracy of only 90.51* when applied. to 
the. 1979 cohort. 

The factor analysis retaining only 23; factors «as also 
varieas rotated'. The aaount of overall variance\accounted 

for was 48*. _ Table 4 shows the results of the calibration ' 

*. * ' 

data applied to the discriiinant function... Table 5 shows the 
cross validation data applied to *he disfcriiinant' function. 
The aceuracy of the calibration data was 92.57*. The acctf- 
.racy of the cross validation data was 89.16*. The hoeogehe- 
ity of within covariance satrices was rejected at the p<.001 
level as the chi-sgu*re value was 489 Mith 276 degrees of 
freedom. ^ 

9 T 
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The discriminant results of the two analyses were sat- 
ginally better than chance for the calibration data. The 
cross validation results are worse than chance. .This fol- 
lows tlie f>a\ttern of previous retention studies at The Oniv- 
er^itj of^Xova. Variables that predict the calibration data 
better th^p chance fail 5 to do so on cross validation data. 

On the basis of the cross validation results, the dif- 
ference between, using 43 factors a^d using 23 factors was 
negligible.. If arty variables are to be used froi the pre- 
enrollaej^ data, the factor scores fro* the 23 factor analy- 
sis shoSK be used. There were no indications froi this or* 
previous studies that a larger^ naaber of these pre-enroll- * 
■eat variables could account for sore of. the variance of 
persistence. Previous studies at The University of Iota did 
no better than the results obtained here. The use of 23 
factors rather than all 132 variables would considerably re- 
duce the costs of analyses using the iCT profile; the iCT 
^scores, and registrar's high school information. Bowever, 
the 23 factors do not predict retention any better than an 
earlier study which used only the 3 variables : High school 
GPi, college preparatory curricullus in High school, and 
student confidence in future academic oc§ vocational plans. 
This seess to indicate that .even 23 factors are too aany and 
- that a saaller set of factors should be used. This result 
. say also indicate that the, pre-enrollnent taiiables say not 
be able to predict persistence with acceptable accuracy. 



• . 9 

In conclusion, it appears that pre-enroll»ent variables 
pet se do not directly predict persistence. Therefore, cor- 
rent research at Iova is based on inforaation sources other 
than the pre-enrollaent variables. The saallest set of facr 
tors that sain^ains the current level of. prediction of- per- 
sistence should be developed and used in retention studies 
to save* on- costs of storage and costs of analysis, but in 
conjunction vith other information. It shotfld be notejl that 
it say be« possible to affect the rate of persistence by in- 

r 

tervention on the basis of pre-enrollaent data, but that has 
..not been investigated here. 
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